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Universal screening of emotional and behavioral problems among students warrants further 
consideration by school professionals. School-based universal screening may provide opportu- 
nities for early identification and intervention, ultimately preventing the development of more 
severe problems and promoting more positive outcomes in the future. The Behavioral and Emo- 
tional Screening System (BESS) is a contemporary screening instrument that may be used to 
identify risk for emotional and behavioral problems in students from preschool to high school. 

The purpose of the present study was to examine the concurrent validity of the BESS in elemen- 
tary school settings. Specifically, this study examined the relation between BESS ratings and 
report-card outcomes (i.e., academic, behavioral, and engagement marks). The results supported 
the hypotheses that students’ risk-level classifications were significantly related to school-based 
outcome criterions and that school-based outcome criterions were deemed to be effective dis- 
criminators of students’ risk-level classification. Limitations, future directions for research, and 
implications for practice are discussed herein. 

Universal screening for students’ emotional and behavioral problems is becoming an increasingly 
important activity for school systems to consider. Given that students with emotional and behavioral 
problems have poor school-related outcomes (Rones & Hoagwood, 2000), school-based screening may 
provide opportunities for early identification and intervention, ultimately preventing the development of 
more severe problems and promoting more positive outcomes in the future (Dowdy, Furlong, Eklund, 
Saeki, & Ritchey, in press). However, despite its ameliorative potential, only about 2% of schools in 
the United States implement universal screening efforts (Romer & McIntosh, 2005). Considering that 
schools often function as the de facto mental health care system for students and adolescents (Rones & 
Hoagwood), the school context affords a unique opportunity to systematically identify and provide sup- 
port services for students with emotional and behavioral problems. 

Screening and School Psychology 

As data-based advocates for students, school psychologists can help identify students with emotion- 
al and behavioral risk by advocating for and implementing universal screening within their local schools. 
As they embark on such efforts, school psychologists should be cognizant of several key considerations 
(Dowdy et ah, in press). First, universal screening should never be isolated - it should always be inte- 
grated within a larger student-support framework. Second, screening efforts should always be accom- 
panied by well-defined objectives, ineluding progress monitoring and service provision aims. Third, the 
pragmatics of screening implementation - who, when, and where - must be established through careful 
consideration and planning. And lastly, decisions must be made regarding which types of emotional and 
behavioral problems to screen for and, by extension, what screening instrument to use. 

To date, there are several research-based instruments for school psychologists to utilize, though 
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not all are appropriate for every school context. Thus, when selecting a screening instrument, it is rec- 
ommended that school psychologists first consider three key aspects (Glover & Albers, 2007): (a) the 
match between the screener, the objectives underlying screening, and the support system surrounding 
the screening process; (b) the technical adequacy - including sensitivity, specificity, positive and nega- 
tive predictive values, and the psychometric properties of the instrument; and (c) the social validity (i.e., 
practicality and feasibility) of using the screener and managing the screening process amidst typical 
school duties and circumstances (e.g., Caldarella, Young, Richardson, Young, & Young, 2008). Mindful 
of the aforementioned considerations, as well as the nature of school psychological services. Dowdy et 
al. (in press) recommend four instruments as potentially useful for school-based universal screening: 
the Strengths and Difficulties Questionnaire (Goodman, 1997), Pediatric Symptom Checklist (Little, 
Murphy, Jellinek, Bishop, & Arnett, 1994), Systematic Screening for Behavior Disorders (Walker & Se- 
verson, 1992), and the Behavior Assessment System for Children-2 Behavioral and Emotional Screening 
System (BASC-2 BESS; Kamphaus & Reynolds, 2007). 

BASC-2 Behavioral and Emotional Screening System 

The BESS (Kamphaus & Reynolds, 2007) is the most recent and least-researched of the four recom- 
mended screening instruments. It is used to identify emotional and behavioral strengths and weaknesses 
in students from preschool to high school, assessing both internalizing and externalizing problems as 
well as school-related difficulties and adaptive skills. It has three parallel report forms - student, parent, 
and teacher - each composed of 25-30 items, designed to be completed in 5 minutes or less. The majority 
of items comprising the BESS stem from the item pool created during the development of the BASC-2 
Teacher Rating Scales, Parent Rating Scales, and Self-Report of Personality (Reynolds & Kamphaus, 
2004), with a range of one-to-eight new items added to each form. Similar to the other BASC-2 assess- 
ments, there are four response options for each item (i.e., never, sometimes, often, almost always); but 
dissimilar from the other assessments, the BESS only produces a single score. This score is conceptual- 
ized as the student’s risk-level classification for emotional and behavioral problems and can fall within 
the range of one of three categories: normal, elevated, or extremely elevated. Although the BESS has yet 
to garner substantial empirical support, its characteristics suggest it will be a promising tool for school 
psychological practice. However, to ensure its effeetiveness and social validity, investigations must first 
demonstrate its school-based, criterion-related validity - requiring concurrent and predictive evidence. 

To date, only two published studies have examined the criterion-related validity of the BESS in 
school settings. The first, eondueted by Kamphaus et al. (2007), used 2-year longitudinal data from 
students in K-5 grades to evaluate the predictive validity of the screener against a variety of behavioral 
and educational outcomes. Overall, correlations indicated that the screener was particularly good at 
predicting future teacher ratings of conduct problems, atypicality, and social skills; future indices of 
school maladjustment, special education placement, and referral for prereferral intervention; and read- 
ing and mathematics grades and standardized test scores. The second study, conducted by DiStefano 
and Kamphaus (2007), used longitudinal data from preschool students to evaluate the concurrent and 
predictive validity of the screener with various diagnostic and educational outcomes in kindergarten. 
Overall, correlations indicated that BESS scores were significantly related to concurrent assessments of 
students’ behavioral symptoms and school readiness as well as predictive assessments of students’ disci- 
plinary infractions; grades for reading, social development, and work habits; BASC-2 teacher-reported 
subscales for externalizing, internalizing, adaptability, school problems, and behavioral symptoms; and 
standardized testing scores for reading and math. Given the limited nature of this research to date, ad- 
ditional studies are warranted to further examine the validity of the BESS. 
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Present Study 

The purpose of the present study was to further evaluate the concurrent validity of the BESS in 
elementary school settings. Speeifically, the intent was to examine the relation between teaehers’ BESS 
ratings with students’ recent report-card outcomes. Overall, there were three hypotheses in this study: 

1. Students’ risk-level elassifieations - derived from sereening results and grouped herein as either 
“normal” or “at-risk” - eould be signifieantly eorrelated with their aeademie, engagement, and 
behavioral outcomes, as graded by their teachers. 

2. There would be signifieant mean differenees between “normal” and “at-risk” students’ aeademie, 
behavioral, and engagement outcomes, showing the relevance of the BESS to school-based indi- 
cators. 

3. Academic, engagement, and behavioral report-card criterions would effectively discriminate be- 
tween students identified as “normal” and “at-risk” via sereening results. 

METHODS 


Participants 

Participants were 26 third-graders and 22 fourth-graders from two elementary schools in a suburban 
community, within the same school district, located on California’s central coast. During the 2008-2009 
school year, the total enrollment of one school was 286 students and the total enrollment of the other was 
421 students. During that time, the demographic make up of both schools was comparable, with approxi- 
mately 73% of students identifying as Hispanic or Latino, 18% as White, and 9% as other or multiple 
ethnic groups. Approximately 68% of the students were elassified as socioeeonomically disadvantaged, 
40% as English language learners, and 14% as students with disabilities. Using class-wide data collec- 
tion procedures, the demographics of the participants in the present study (N = 48) were representative 
of the student population in these schools. 

Measures 

BESS teacher form. The BESS teacher form (child/adolescent version) is completed by teachers of 
students in grades K-12 (Kamphaus & Reynolds, 2007). It consists of 25 items and is designed to be 
completed in 5 minutes or less per student. The screener is scored by summing the items to generate a 
total T-score, with lower scores (20-60) reflecting a “normal” level of risk, higher scores (61-70) reflect- 
ing an “elevated” level of risk, and still higher scores (7 1 or above) reflecting an “extremely elevated” 
level of risk. The BESS teacher form was developed and normed with a sample of 12,350 accompany- 
ing parent and student forms, derived from participants in 233 cities across 40 states. Results from the 
norming process indicate that the psychometric properties of the BESS (across all forms) are generally 
acceptable, having good split-half reliability (.90-. 96), test-retest reliability (.80-. 91), inter-rater reliabil- 
ity (.71-. 83), sensitivity (.44-. 82), and specificity (.90-. 97). Furthermore, the measure has also proven 
to have acceptable convergent validity with the Achenbach System of Empirically Based Assessment 
(.71-. 77), Conner’s Rating Scales (.51-. 78), Vineland Adaptive Behavior Scales (.32-. 69), Children’s 
Depression Inventory (.51), and the Revised Children’s Manifest Anxiety Scale (.55). 

Report cards. Students’ report cards consisted of academic, engagement, and behavioral indicators, 
graded by their teachers. The academic indicators comprised 6 total subject areas - listening, reading, 
writing, math, history, and science - and corresponded to California state educational standards. Each 
indicator was graded on a scale of 1 to 4 (1 = has difficulty with standard, 2 = approaches standard, 3 = 
meets and applies standard, 4 = exceeds standard), indicating teachers’ perceptions of students’ present 
levels of achievement. Eor the purposes of this study, each subject area was conceptualized as a sub- 
composite, making up a total Academic Achievement composite. A behavioral engagement indicator 
accompanied each subject area, wherein the teachers graded the amount of “effort” students exhibited in 
meeting academic standards, using the same grading scale. Because these engagement indicators were 
unidimensional and few in number, for the purposes of this study they were summed into a total Engage- 
ment composite. The report card also consisted of several behavioral indicators (e.g., “Eollows rules and 
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direction;” “Completes classwork;” “Works well in a group”), graded on a l-to-3 scale (1 = needs im- 
provement, 2 = satisfactory, 3 = excellent). Using the same rationale as the engagement indicators, these 
behavioral indicators were summed into a total Behavioral Performance composite. 

Procedures 

During the first quarter of the school year, the BESS teacher form was completed for all the third- 
graders attending one school and all the fourth-graders attending the other school. For both grades com- 
bined, screening outcomes indicated that 70% of students were in the normal range (n = 77), 18% were 
in the elevated range (n = 20), and 12% were in the extremely elevated range (n = 13). Thus, for the 
purposes of this study, students in the elevated and extremely elevated ranges were grouped together, 
resulting in dichotomized risk-level classification: normal (T-score of 20 to 60) or at-risk (T-score of 
61 and above). Using this classification method, screening results indicated that 20 third-graders and 
13 fourth-graders had BESS scores in the at-risk range. In an attempt to create matched groups, the 13 
at-risk fourth-graders were selected to participate in the study, matched with a random selection of 1 3 
normal fourth-graders. A random selection of 13 at-risk third-graders was then conducted, matched with 
a random selection of 13 normal third-graders. During the course of the study, however, 2 at-risk fourth- 
graders were transferred to another school, and so the matched pairs were reduced to 1 1 fourth-graders 
and 13 third-graders in each group (N = 48). 

Next, the sample participants’ first quarter report cards - graded within a few weeks of BESS 
completion - were examined and coded. The Listening, Reading, Writing, Math, History, and Science 
sub-composites were generated and weighted by summing the indicators associated with each subject 
area and then dividing that total by the respective number of indicators. The Academic Achievement, 
Engagement, and Behavioral Performance composites were derived via the same process as the sub- 
composites, using their respective indicators. Following data collection and preparation, the aforemen- 
tioned hypotheses were examined by conducting three sets of statistical analyses: bivariate correlations, 
a one-way ANOVA, and discriminant function analyses. All analyses utilized the previously described 
sub-composites or the general composites; no isolated indicators were included in the analyses. 

RESULTS 

Results indicated that teacher-rated BESS risk-level classification (i.e., either normal or at-risk) 
was significantly related with students’ concurrent academic, engagement, and behavioral outcomes, as 
reported on their report cards. Specifically, risk-level classification was significantly correlated with each 
academic sub-composite, contributing to a significant correlation with the overall Academic Achieve- 
ment composite. In addition, significant correlations were also found between risk level and the Engage- 
ment and Behavioral Performance composites. Furthermore, results from the one-way ANOVA indicated 
significant differences between the mean scores of the normal and at-risk students for each of the sub- 
composites as well as the overall Academic Achievement, Engagement, and Behavioral Performance 
composites. See Table 1 and Table 2 for statistical summaries of these results. 

Two separate discriminant function analyses were conducted to determine if the academic, engage- 
ment, and behavioral composites, derived from the recent report cards, were effective criterions for 
discriminating between students’ risk-level classification. Model 1 was theoretically driven, entering 
the Academic Achievement, Engagement, and Behavioral Performance composites as simultaneous dis- 
criminant criterions. Results revealed that this model accounted for approximately 43% of the variance 
between risk levels and was an effective discriminator for risk-level classification (x‘= 25.401, p = 
.000). Using this model, it was predicted that 75% of normal and 88% of at-risk students were classi- 
fied correctly. Model 2 was statistically driven, using the stepwise method to determine which of the 
three composites were the most salient discriminators for risk-level classification. The resulting model 
included only the Academic Achievement and Engagement composites and accounted for approximately 
43% of the variance between risk levels. Similar to the previous model, it was also deemed an effective 
discriminator for risk-level classification (x‘= 25.495, p = .000). Furthermore, using this latter model, it 
was predicted that 67% of normal and 88% of at-risk students were classified correctly. See Table 3 and 
Table 4 for summaries of these results. 
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TABLE 1: Correlations between BESS Risk Level and Report Card Outcomes 


Report Card Outcomes 


Correlation 
with Risk Level 


p-value 


Academic Sub-Composites 


Listening 

-.503 

.000 

Reading 

-.461 

.000 

Writing 

-.435 

.001 

M ath 

-.393 

.003 

History 

-.528 

.000 

Science 

-.674 

.000 

eneral Composites 



Academic Achievement 

-.549 

.000 

E ngagement ( B ehavi oral ) 

-.614 

.000 

Behavioral Performance 

-.507 

.000 


TABLE 2; One- Way ANO VA for BESS Risk Level and Report Card Outcomes 


Report Card Outcomes 

F 

p-value 

Between Groups 


Academic Sub-Composites 

Listening 

15.545 

.000 

Reading 

12.440 

.001 

Writing 

10.745 

.002 

M ath 

8.424 

.006 

History 

17.769 

.000 

Science 

20.000 

.000 

General Composites 

Academic Achievement 

19.827 

.000 

Engagement (Behavioral) 

27.824 

.000 

Behavioral Performance 

15.947 

.000 
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TABLE 3: Discriminant Function Analyses: Model Validity 


Variables Wilk’sA, p-value 


M 0 del 1 (Theoretical)* .565 25.401 .000 

Academic Achievement 
Engagement 
Behavioral Performance 

M odel 2 (Statistical)** .567 25.495 .000 

Step 1: Engagement 
Step 2: Academic Achievement 


*A II variables were entered simultaneously. 
** Variables entered using stepwise method. 


TABLE 4: Discriminant Function Analyses: Group Membership Classifications 


Group 


Predicted Group Membership 
Normal At-Risk 


M odel 1 (Theoretical)* 


Normal 

18 

(75%) 

6 

(25%) 

At-Risk 

3 

(12%) 

21 

(88%) 

M odel 2 (Statistical)** 



Normal 

16 

(67%) 

8 

(33%) 

At-Risk 

3 

(12%) 

21 

(88%) 

* 80.1% of original grouped cases correctly classified. 
** 77.1% of original grouped cases correctly classified. 



DISCUSSION 


The purpose of the present study was to further examine the concurrent validity of the BESS in 
elementary school settings, by examining the relation between teachers’ BESS ratings and recent report- 
card outcomes. Consistent with the extant research, the results supported our hypotheses that students’ 
risk-level classifications would be significantly related to school-based outcome criterions and that such 
school-based outcome criterions, as reported via report cards, would be effective discriminators of stu- 


dents’ risk-level classification. 
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These results provide additional concurrent validity evidence for using the BESS in elementary 
school settings, showing that teacher ratings on the screener are highly related to their evaluations of stu- 
dents’ academic, engagement, and behavioral outcomes, as reported via report cards. Specifically, results 
revealed that BESS risk-level classification was moderately or highly correlated with all of the academic 
sub-composites as well as the three general composites; that there were significant mean differences be- 
tween normal and at-risk students on all sub-composites and composites; and that the report-card derived 
composites were effective discriminators for risk-level classification. Interestingly, however, results also 
suggest that similar discriminant results could be obtained by excluding the Behavioral Performance 
composite. Thus, such findings warrant further examination of which school-based criterions will pro- 
vide the best concurrent validity for the BESS. But in general, these findings add to the existing evidence 
supporting the BESS as an appropriate school-based screening instrument. 

Limitations 

Although the results of the present study supported the concurrent validity of the BESS, there are 
three limitations. Eirst, the sample characteristics were limited: only third- and fourth-graders partici- 
pated in this study; the students were all from the same district; and they identified as predominantly 
Hispanic or White. Thus, the results have low generalizability for students in other grades or locations 
and identifying with other ethnicities. Second, this study, similar to the previous research validating 
the BESS, focused only on teacher ratings. Given that the BESS also consists of parent- and self-rating 
components, it is unknown how these school-based criterions would relate to risk-level classifications 
derived from other raters. As such, these results should only be construed as supporting one facet of the 
overall BESS. Third, the school-based criterions used herein — ^report card outcomes — are idiosyncratic 
to the local school district. Other districts within other states have varying report-card indicators; thus, 
further replication is warranted. 

Future Directions 

Despite limitations, findings from this study suggest several directions for future research. Fore- 
most, the significant relations observed between BESS risk-level classification and local report-card 
criterions warrant further examination with various other report-card indicators, across grade levels, 
within varying school contexts, and using other behavioral indicators. To enhance the criterion-related 
validity, such outcomes could be used for both concurrent and predictive evidence. Furthermore, the 
difference in validity coefficients between local school criterions (e.g., report card grades) and global 
school criterions (e.g., standardized testing results and other BASC-2 ratings) also warrants evaluation. 
Previous validation studies (DiStefano & Kamphaus, 2007; Kamphaus et al., 2007) have focused primar- 
ily on global school criterions, providing limited information on the criterions inherent within the local 
school context. And lastly, future research may also benefit from utilizing further discriminant analyses, 
attempting to determine which types of criterions are the most effective for discriminating between stu- 
dents’ risk-level classification. 

Implications for Practice 

Considering the results of the present study in conjunction with the existing analyses of the BESS, 
the results are promising. The psychometric qualities and school-based validity of the BESS appear suit- 
able for using the instrument as a universal screener for students, seeking to identify those at-risk for 
potential emotional or behavioral problems. As noted in the introduction, it is essential that universal 
screening should always be (a) integrated within a larger student-support framework, (b) accompanied 
by well-defined objeetives, and (c) be established through eareful consideration and planning (Dowdy 
et al., in press). By incorporating these elements and implementing a valid screening instrument like the 
BESS, school psychologists may ultimately enhance early identification and intervention efforts within 
their local school context - preventing the development of more severe problems and promoting more 
positive outcomes for students in the future. 
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